Splitting Complex English Sentences

نویسندگان

  • John Lee
  • J. Buddhika K. Pathirage Don
چکیده

This paper applies parsing technology to the task of syntactic simplification of English sentences, focusing on the identification of text spans that can be removed from a complex sentence. We report the most comprehensive evaluation to-date on this task, using a dataset of sentences that exhibit simplification based on coordination, subordination, punctuation/parataxis, adjectival clauses, participial phrases, and appositive phrases. We train a decision tree with features derived from text span length, POS tags and dependency relations, and show that it significantly outperforms a parser-only baseline.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Structural Organization of Modern English Multiple Complex-Compound

The article focuses on the factors that cause linear and vertical sentence extension of multiple complex-compound sentences used in English fictional literature. Considering the sentence structure as a combination of 2 Units – paratactic and hypotactic the authors define the structural peculiarities of paratactic and hypotactic units including the number of clauses and its bonds. The extension ...

متن کامل

Boosting Unsupervised Grammar Induction by Splitting Complex Sentences on Function Words

The statistical-structural algorithm for unsupervised language acquisition, ADIOS (for Automatic DIstillation Of Structure), developed by Solan et al. (2005), has been shown capable of learning precise and productive grammars from realistic, raw and unannotated corpus data, including transcribed children-directed speech from the CHILDES corpora, in languages as diverse as English and Mandarin. ...

متن کامل

A Multilingual Method for Clause Splitting

This paper addresses the clause splitting problem and proposes a multilingual method for detecting clause boundaries in unrestricted texts. The method combines language independent machine learning techniques with language specific rules in order to take the first step in building the hierarchical structure of sentences. The results of a machine learning algorithm, trained on an annotated corpu...

متن کامل

An Investigation into the Effective Factors in Comprehending English Garden-Path Sentences by EFL Learners

The present study aimed at highlighting the possible effects of age, proficiency level, and the structural composition of Garden-Path (GP) sentences on EFL learners' comprehension. 80 Iranian EFL learners were recruited from the initial pool of 114 participants based on the results of an English proficiency test; 40 advanced, and 40 intermediate learners were selected. Moreover, two age...

متن کامل

Presupposition Role in the Compound-Complex Sentence

The article analyzes the role of presupposition in the compound-complex sentence. The authors examine the types of presuppositions and minimal compound-complex sentence as a field of presuppositions action. The analysis of these types of sentences by the material of the English language shows that several kinds of presuppositions are realized in them – contact or distant, - developing in the re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017